# COCO Dataset

Yolov10x
YOLOv10x is the latest version of the YOLO series, focusing on real-time end-to-end object detection, offering higher detection accuracy and faster inference speed.
Object Detection
Y
jameslahm
1,145
41
Yolov10l
YOLOv10 is a real-time end-to-end object detection model developed by the Tsinghua University team, based on the latest improved version of the YOLO series.
Object Detection
Y
jameslahm
186
3
Yolov10b
YOLOv10 is a real-time end-to-end object detection model developed by the Tsinghua University team, representing the latest improvement in the YOLO series.
Object Detection
Y
jameslahm
97
2
Yolov10m
YOLOv10 is a real-time end-to-end object detection model proposed by Tsinghua University, known for its efficiency and accuracy.
Object Detection
Y
jameslahm
1,003
7
Yolov10s
YOLOv10 is a real-time object detection model that achieves efficient and overhead-free object detection by eliminating post-processing steps such as Non-Maximum Suppression (NMS).
Object Detection
Y
kadirnar
15
0
Mask2former Swin Tiny Coco Panoptic
Other
Mask2Former is a Transformer-based unified image segmentation model supporting instance segmentation, semantic segmentation, and panoptic segmentation tasks, utilizing masked attention mechanism to enhance performance
Image Segmentation Transformers
M
facebook
4,538
8
Mask2former Swin Small Coco Panoptic
Other
A small-scale version of Mask2Former based on Swin backbone network, optimized for panoptic segmentation tasks on the COCO dataset
Image Segmentation Transformers
M
facebook
240
1
Mask2former Swin Large Coco Panoptic
Other
A large-scale version of Mask2Former based on the Swin backbone network, specifically trained for panoptic segmentation tasks on the COCO dataset
Image Segmentation Transformers
M
facebook
37.67k
30
Mask2former Swin Base Coco Panoptic
Other
The Mask2Former model based on the Swin backbone network, trained on the COCO panoptic segmentation dataset, adopts a unified paradigm to handle instance segmentation, semantic segmentation, and panoptic segmentation tasks.
Image Segmentation Transformers
M
facebook
45.01k
14
Mask2former Swin Large Coco Instance
Other
Mask2Former is a Transformer-based unified image segmentation model, utilizing a Swin-Large backbone and fine-tuned on the COCO dataset, specializing in instance segmentation tasks.
Image Segmentation Transformers
M
facebook
37.31k
6
Oneformer Coco Dinat Large
MIT
A unified single Transformer architecture for image segmentation, supporting three major tasks: semantic segmentation, instance segmentation, and panoptic segmentation
Image Segmentation Transformers
O
shi-labs
38
7
Yolos Small 300
Apache-2.0
A small-sized YOLOS model fine-tuned on the COCO 2017 object detection dataset, utilizing Vision Transformer architecture for efficient object detection
Object Detection Transformers
Y
hustvl
86
6
Yolos Small Dwr
Apache-2.0
A YOLOS model fine-tuned on the COCO 2017 object detection dataset, utilizing a Vision Transformer architecture, suitable for object detection tasks.
Object Detection Transformers
Y
hustvl
33
4
Yolos Small
Apache-2.0
A vision Transformer (ViT)-based object detection model trained with DETR loss function, achieving excellent performance on the COCO dataset.
Object Detection Transformers
Y
hustvl
154.46k
63
Yolos Tiny
Apache-2.0
YOLOS model fine-tuned on the COCO 2017 object detection dataset, utilizing Vision Transformer architecture for efficient object detection.
Object Detection Transformers
Y
hustvl
144.58k
266
Detr Resnet 50 Panoptic
Apache-2.0
DETR is an end-to-end object detection model based on Transformer architecture, using ResNet-50 as the backbone network, trained on the COCO dataset, and supports object detection and panoptic segmentation tasks.
Image Segmentation Transformers
D
facebook
9,586
137
Maskformer Swin Large Coco
Other
Large-scale MaskFormer model based on Swin backbone network, unifying instance/semantic/panoptic segmentation tasks
Image Segmentation Transformers
M
facebook
849
24
Maskformer Swin Small Coco
Other
A small MaskFormer model based on the Swin backbone network, trained on the COCO dataset for panoptic segmentation tasks.
Image Segmentation Transformers
M
facebook
2,293
3
Maskformer Swin Tiny Coco
Other
A panoptic segmentation model trained on the COCO dataset, using a unified paradigm to handle instance/semantic/panoptic segmentation tasks
Image Segmentation Transformers
M
facebook
301
6
Maskformer Swin Base Coco
Other
A panoptic segmentation model based on the Swin backbone network, trained on the COCO dataset, unifying instance/semantic/panoptic segmentation tasks
Image Segmentation Transformers
M
facebook
3,855
24
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase